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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Met Lys Arg Leu Gly Ala Val Val Leu Ala Leu 

5 10 



Leu Val Gly 
15 



Leu Leu Ala Gly Thr Ala Leu Ala Ala Pro Val Lys Pro Val Val 
20 25 30 

Arg Asn Asn Ala Val Gin Gin Lys Asn Tyr Gly Leu Leu Thr Pro 
35 40 45 

Gly Leu Phe Lys Lys Val Gin Arg Met Asn Trp Asn Gin Glu Val 
50 55 60 

Asp Thr Val lie Met Phe Gly Ser Tyr Gly Asp Arg Asp Arg Ala 
65 70 75 

Val Lys Val Leu Arg Leu Met Gly Ala Gin Val Lys Tyr Ser Tyr 
80 85 90 

Lys lie lie Pro Ala Val Ala Val Lys He Lys Ala Arg Asp Leu 
95 100 105 

Leu Leu He Ala Gly Met He Asp Thr Gly Tyr Phe Gly Asn Thr 
110 115 120 

Arg Val Ser Gly He Lys Phe He Gin Glu Asp Tyr Lys Val Gin 
125 130 135 

Val Asp Asp Ala Thr Ser Val Ser Gin He Gly Ala Asp Thr Val 
140 * 145 150 

Trp Asn Ser Leu Gly Tyr Asp Gly Ser Gly Val Val Val Ala He 
155 160 165 

Val Asp Thr Gly He Asp Ala Asn His Pro Asp Leu Lys Gly Lys 
170 175 180 

Val He Gly Trp Tyr Asp Ala Val Asn Gly Arg Ser Thr Pro Tyr 
185 190 195 

Asp Asp Gin Gly His Gly Thr His Val Ala Gly He Val Ala Gly 
200 205 210 

Thr Gly Ser Val Asn Ser Gin Tyr He Gly Val Ala Pro Gly Ala 
215 220 225 

Lys Leu Val Gly Val Lys Val Leu Gly Ala Asp Gly Ser Gly Ser 
230 235 240 

Val Ser Thr He He Ala Gly Val Asp Trp Val Val Gin Asn Lys 
245 250 255 

Asp Lys Tyr Gly He Arg Val He Asn Leu Ser Leu Gly Ser Ser 
260 265 270 

Gin Ser Ser Asp Gly Thr Asp Ser Leu Ser Gin Ala Val Asn Asn 
275 280 285 

Ala Trp Asp Ala Gly He Val Val Cys Val Ala Ala Gly Asn Ser 
290 295 300 

Gly Pro Asn Thr Tyr Thr Val Gly Ser Pro Ala Ala Ala Ser Lys 
305 310 315 

Val He Thr Val Gly Ala Val Asp Ser Asn Asp Asn He Ala Ser 
320 325 330 

Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu Lys Pro Glu 
335 340 345 
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Val Val Ala Pro Gly Val Asp He He Ala Pro Arg Ala Ser Gly 

350 355 360 

Thr Ser Met Gly Thr Pro He Asn Asp Tyr Tyr Thr Lys Ala Ser 

365 370 375 

Gly Thr Ser Met Ala Thr Pro His Val Ser Gly Val Gly Ala Leu 

380 385 390 

He Leu Gin Ala His Pro Ser Trp Thr Pro Asp Lys Val Lys Thr 

395 400 405 

Ala Leu He Glu Thr Ala Asp He Val Ala Pro Lys Glu He Ala 

410 415 420 

Asp He Ala Tyr Gly Ala Gly Arg Val Asn Val Tyr Lys Ala He 

425 430 435 

Lys Tyr Asp Asp Tyr Ala Lys Leu Thr Phe Thr Gly Ser Val Ala 

440 445 450 

Asp Lys Gly Ser Ala Thr His Thr Phe Asp Val Ser Gly Ala Thr 

455 460 465 

Phe Val Thr Ala Thr Leu Tyr Trp Asp Thr Gly Ser Ser Asp He 

470 .475 480 

Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Glu Val Asp Tyr Ser 

485 490 495 

Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro 

500 505 510 

Thr Ala Gly Thr Trp Thr Val Lys Val Val Ser Tyr Lys Gly Ala 

515 520 525 

Ala Asn Tyr Gin Val Asp Val Val Ser Asp Gly Ser Leu Ser Gin 

530 535 540 

Ser Gly Gly Gly Asn Pro Asn Pro Asn Pro Asn Pro Asn Pro Thr 

545 550 555 

Pro Thr Thr Asp Thr Gin Thr Phe Thr Gly Ser Val Asn Asp Tyr 

560 565 570 

Trp Asp Thr Ser Asp Thr Phe Thr Met Asn Val Asn Ser Gly Ala 

575 580 585 

Thr Lys He Thr Gly Asp Leu Thr Phe Asp Thr Ser Tyr Asn Asp 

590 595 600 

Leu Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Leu Val Asp Arg 

605 610 615 

Ser Thr Ser Ser Asn Ser Tyr Glu His Val Glu Tyr Ala Asn Pro 

620 625 630 

Ala Pro Gly Thr Trp Thr Phe Leu Val Tyr Ala Tyr Ser Thr Tyr 

635 640 645 

Gly Trp Ala Asp Tyr Gin Leu Lys Ala Val Val Tyr Tyr Gly 
650 655 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ATGAAGAGGT TAGGTGCTGT GGTGCTGGCA CTGGTGCTCG TGGGTCTTCT GGCCGGAACG 6 0 
GCCCTTGCGG CACCCGTAAA ACCGGTTGTC AGGAACAACG CGGTTCAGCA GAAGAACTAC 120 
GGACTGCTGA CCCCGGGACT GTTCAAGAAA GTCCAGAGGA TGAACTGGAA CCAGGAAGTG 180 
GACACCGTCA TAATGTTCGG GAGCTACGGA GACAGGGACA GGGCGGTTAA GGTACTGAGG 24 0 
CTCATGGGCG CCCAGGTCAA GTACTCCTAC AAGATAATCC CTGCTGTCGC GGTTAAAATA 3 00 
AAGGCCAGGG ACCTTCTGCT GATCGCGGGC ATGATAGACA CGGGTTACTT CGGTAACACA 3 60 
AGGGTCTCGG GCATAAAGTT CATACAGGAG GATTACAAGG TTCAGGTTGA CGACGCCACT 420 
TCCGTCTCCC AGATAGGGGC CGATACCGTC TGGAACTCCC TCGGCTACGA CGGAAGCGGT 4 80 
GTGGTGGTTG CCATCGTCGA TACGGGTATA GACGCGAACC ACCCCGATCT GAAGGGCAAG 54 0 
GTCATAGGCT GGTACGACGC CGTCAACGGC AGGTCGACCC CCTACGATGA CCAGGGACAC 6 00 
GGAAC CC ACG TTGCGGGTAT CGTTGCCGGA ACCGGCAGCG TTAACTCCCA GTACATAGGC 6 60 
GTCGCCCCCG GCGCGAAGCT CGTCGGCGTC AAGGTTCTCG GTGCCGACGG TTCGGGAAGC 720 
GTCTCCACCA TCATCGCGGG TGTTGACTGG GTCGTCCAGA ACAAGGACAA GTACGGGATA 78 0 
AGGGT CATC A ACCTCTCCCT CGGCTCCTCC CAGAGCTCCG ACGGAACCGA CTCCCTCAGT 840 
CAGGC CGTCA ACAACGCCTG GGACGCCGGT AT AGTAGTC T GCGTCGCCGC CGGCAACAGC 9 00 
GGGCCGAACA CCTACACCGT CGGCTCACCC GCCGCCGCGA GCAAGGTCAT AACCGTCGGT 960 

G CAGTTG AC A GCAACGACAA CATCGCCAGC TTCTCCAGCA GGGGACCGAC CGCGGACGGA 1020 

AGGCTCAAGC CGGAAGTCGT CGCCCCCGGC GTTGACATCA TAGCCCCGCG CGCCAGCGGA 1080 

ACCAGCATGG GCACCCCGAT AAACGACTAC TACACCAAGG CCTCTGGAAC CAGCATGGCC 1140 

ACCCCGCACG TTTCGGGCGT TGGCGCGCTC ATCCTCCAGG CCCACCCGAG CTGGACCCCG 12 00 

GACAAGGTGA AGACCGCCCT CATCGAGACC GCCGACATAG TCGCCCCCAA GGAGATAGCG 12 60 

GACATCGCCT ACGGTGCGGG T AGGGT G AAC GTCTACAAGG CCATCAAGTA CGACGACTAC 13 20 

GCCAAGCTCA CCTTCACCGG CTCCGTCGCC GACAAGGGAA GCGCCACCCA CACCTTCGAC 13 80 

GTCAGCGGCG CCACCTTCGT GACCGCCACC CTCTACTGGG ACACGGGCTC GAGCGACATC 144 0 

GACCTCTACC TCTACGACCC CAACGGGAAC GAGGTTGACT ACTCCTACAC CGCCTACTAC 15 00 

GGCTT CG AG A AGGTCGGCTA CTACAACCCG ACCGCCGGAA CCTGGACGGT CAAGGTCGTC 1560 

AGCTACAAGG GCGCGGCGAA CTACCAGGTC GACGTCGTCA GCGACGGGAG CCTCAGCCAG 1620 

TCCGG CGGCG GCAACCCGAA TCCAAACCCC AACCCGAACC CAACCCCGAC CACCGACACC 16 8 0 

CAG AC CTTC A CCGGTTCCGT TAACGACTAC TGGGACACCA GCGACACCTT CACCATGAAC 174 0 
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GTCAACAGCG GTGCCACCAA GATAACCGGT GACCTGACCT TCGATACTTC CTACAACGAC 18 00 

CTCGACCTCT ACCTCTACGA CCCCAACGGC AACCTCGTTG ACAGGTCCAC GTCGAGCAAC 1860 

AGCTACGAGC ACGTCGAGTA CGCCAACCCC GCCCCGGGAA CCTGGACGTT CCTCGTCTAC 1920 

GCCTACAGCA CCTACGGCTG GGCGGACTAC CAGCTCAAGG CCGTCGTCTA CTACGGG 197 7 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(D) OTHER INFORMATION: /note= Xaa at position 428 is Gly or Val . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Ala Glu Leu Glu Gly Leu Asp Glu Ser Ala Ala Gin Val Met Ala 

5 10 15 

Thr Tyr Val Trp Asn Leu Gly Tyr Asp Gly Ser Gly lie Thr lie 
20 25 30 

Gly lie lie Asp Thr Gly lie Asp Ala Ser His Pro Asp Leu Gin 
35 40 45 

Gly Lys Val lie Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr 
50 55 60 

Pro Tyr Asp Asp His Gly His Gly Thr His Val Ala Ser lie Ala 
65 70 75 

Ala Gly Thr Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala 
80 85 90 

Pro Gly Ala Lys Leu Ala Gly lie Lys Val Leu Gly Ala Asp Gly 
95 100 105 

Ser Gly Ser lie Ser Thr lie lie Lys Gly Val Glu Trp Ala Val 
110 115 120 

Asp Asn Lys Asp Lys Tyr Gly lie Lys Val lie Asn Leu Ser Leu 
125 130 135 

Gly Ser Ser Gin Ser Ser Asp Gly Thr Asp Ala Leu Ser Gin Ala 
140 145 150 

Val Asn Ala Ala Trp Asp Ala Gly Leu Val Val Val Val Ala Ala 
155 160 165 

Gly Asn Ser Gly Pro Asn Lys Tyr Thr lie Gly Ser Pro Ala Ala 
170 175 180 

Ala Ser Lys Val lie Thr Val Gly Ala Val Asp Lys Tyr Asp Val 
185 190 195 

lie Thr Ser Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu 
200 205 210 



* 
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Lys Pro Glu Val 



Val 
215 



Ala Pro Gly Asn 



Trp 
220 



He 



He Ala 



Ala 



Arg 
225 



Ala Ser Gly Thr Ser Met Gly Gin Pro He Asn Asp Tyr Tyr Thr 
230 235 240 

Ala Ala Pro Gly Thr Ser Met Ala Thr Pro His Val Ala Gly He 
245 250 255 

Ala Ala Leu Leu Leu Gin Ala His Pro Ser Trp Thr Pro Asp Lys 
260 265 270 

Val Lys Thr Ala Leu He Glu Thr Ala Asp He Val Lys Pro Asp 
275 280 285 

Glu He Ala Asp He Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr 
290 295 300 

Lys Ala He Asn Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly 
305 310 315 

Tyr Val Ala Asn Lys Gly Ser Gin Thr His Gin Phe Val He Ser 
320 325 330 

Gly Ala Ser Phe Val Thr Ala Thr Leu Tyr Trp Asp Asn Ala. Asn 
335 340 345 

Ser Asp Leu Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Gin Val 
350 355 360 

Asp Tyr Ser Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr 
365 370 375 

Tyr Asn Pro Thr Asp Gly Thr Trp Thr He Lys Val Val Ser Tyr 
380 385 390 

Ser Gly Ser Ala Asn Tyr Gin Val Asp Val Val Ser Asp Gly Ser 
395 400 405 

Leu Ser Gin Pro Gly Ser Ser Pro Ser Pro Gin Pro Glu Pro Thr 
410 415 420 

Val Asp Ala Lys Thr Phe Gin Xaa Ser Asp His Tyr Tyr Tyr Asp 
425 430 435 

Arg Ser Asp Thr Phe Thr Met Thr Val Asn Ser Gly Ala Thr Lys 
440 445 450 

He Thr Gly Asp Leu Val Phe Asp Thr Ser Tyr His Asp Leu Asp 
455 460 465 

Leu Tyr Leu Tyr Asp Pro Asn Gin Lys Leu Val Asp Arg Ser Glu 
470 475 480 

Ser Pro Asn Ser Tyr Glu His Val Glu Tyr Leu Thr Pro Ala Pro 
485 490 495 

Gly Thr Trp Tyr Phe Leu Val Tyr Ala Tyr Tyr Thr Tyr Gly Trp 
500 505 510 

Ala Tyr Tyr Glu Leu Thr Ala Lys Val Tyr Tyr Gly 



515 



520 



(2) INFORMATION 



FOR 



SEQ ID NO: 4 : 



(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 1566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: genomic DNA 
(ix) FEATURE: 

(D) OTHER INFORMATION: /note= N at position 12 83 is G or T. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



GCAGAATTAG 


AAGGACTGGA 


TGAGTCTGCA 


GCTCAAGTTA 


TGGCAACTTA 


CGTTTGGAAC 


60 


TTGGGATATG 


ATGGTTCTGG 


AATCACAATA 


GGAATAATTG 


ACACTGGAAT 


TGACGCTTCT 


120 


CATCCAGATC 


TCCAAGGAAA 


AGTAATTGGG 


TGGGTAGATT 


TTGTCAATGG 


TAGGAGTTAT 


180 


CCATACGATG 


AC C ATGG AC A 


TGGAACTCAT 


GTAGCTTCAA 


TAGCAGCTGG 


TACTGGAGCA 


240 


GCAAGTAATG 


GCAAGTACAA 


GGGAATGGCT 


CCAGGAGCTA 


AGCTGGCGGG 


AATTAAGGTT 


300 


CTAGGTGCCG 


ATGGTTCTGG 


AAGCATATCT 


ACTATAATTA 


AGGGAGTTGA 


GTGGGCCGTT 


360 


GATAACAAAG 


ATAAGTACGG 


AATTAAGGTC 


ATTAATCTTT 


CTCTTGGTTC 


AAGCCAGAGC 


420 


TCAGATGGTA 


CTGACGCTCT 


AAGTCAGGCT 


GTTAATGCAG 


CGTGGGATGC 


TGGATTAGTT 


480 


GTTGTGGTTG 


CCGCTGGAAA 


CAGTGGACCT 


AACAAGTATA 


CAATCGGTTC 


TCCAGCAGCT 


540 


GCAAGCAAAG 


TTATTACAGT 


TGGAGCCGTT 


GACAAGTATG 


ATGTTATAAC 


AAGCTTCTCA 


600 


AGCAGAGGGC 


CAAC TGCAGA 


CGGCAGGCTT 


AAGCCTGAGG 


TTGTTGCTCC 


AGGAAACTGG 


660 


ATAATTGCTG 


CCAGAGCAAG 


TGGAACTAGC 


ATGGGTCAAC 


CAATTAATGA 


CTATTACACA 


720 


GCAGCTCCTG 


GGACATCAAT 


GGCAACTCCT 


CACGTAGCTG 


GTATTGCAGC 


CCTCTTGCTC 


780 


CAAGCACACC 


CGAGCTGGAC 


TCCAGACAAA 


GTAAAAACAG 


CCCTCATAGA 


AACTGCTGAT 


840 


ATCGTAAAGC 


CAGATGAAAT 


AGCCGATATA 


GCCTACGGTG 


CAGGTAGGGT 


TAATGCATAC 


900 


AAGGCTATAA 


ACTACGATAA 


CTATGCAAAG 


CTAGTGTTCA 


CTGGATATGT 


TGCCAACAAA 


960 


GGCAGCCAAA 


CTCACCAGTT 


CGTTATTAGC 


GGAGCTTCGT 


TCGTAACTGC 


CACATTATAC 


1020 


TGGGACAATG 


CCAATAGCGA 


CCTTGATCTT 


TACCTCTACG 


ATCCCAATGG 


AAACCAGGTT 


1080 


GACTACTCTT 


ACACCGCCTA 


CTATGGATTC 


GAAAAGGTTG 


GTTATTACAA 


CCCAACTGAT 


1140 


GGAACATGGA 


CAATTAAGGT 


TGTAAGCTAC 


AGCGGAAGTG 


CAAACTATCA 


AGTAGATGTG 


1200 


GTAAGTGATG 


GTTCCCTTTC 


ACAGCCTGGA 


AGTTCACCAT 


CTCCACAACC 


AGAACCAACA 


1260 


GTAGACGCAA 


AGACGTTCCA 


AGNATCCGAT 


CACTACTACT 


ATGACAGGAG 


CGACACCTTT 


1320 


ACAATGACCG 


TTAACTCTGG 


GGCTACAAAG 


ATTACTGGAG 


ACCTAGTGTT 


TGACACAAGC 


1380 


TACCATGATC 


TTGACCTTTA 


CCTCTACGAT 


CCTAACCAGA 


AGCTTGTAGA 


TAGATCGGAG 


1440 


AGTCCCAACA 


GCTACGAACA 


CGTAGAATAC 


TTAACCCCCG 


CCCCAGGAAC 


CTGGTACTTC 


1500 


CTAGTATATG 


CCTACTACAC 


TTACGGTTGG 


GCTTACTACG 


AGCTGACGGC 


TAAAGTTTAT 


1560 


TATGGC 












1566 



(2) INFORMATION FOR SEQ ID NO : 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Lys Gly Leu Lys Ala Leu lie Leu Val lie Leu Val Leu Gly 

5 10 15 

Leu Val Val Gly Ser Val Ala Ala Ala Pro Glu Lys Lys Val Glu 
20 25 30 

Gin Val Arg Asn Val Glu Lys Asn Tyr Gly Leu Leu Thr Pro Gly 
35 40 45 

Leu Phe Arg Lys lie Gin Lys Leu Asn Pro Asn Glu Glu lie Ser 
50 55 60 

Thr Val lie Val Phe Glu Asn His Arg Glu Lys Glu lie Ala Val 
65 70 75 

Arg Val Leu Glu Leu Met Gly Ala Lys Val Arg Tyr Val Tyr His 
80 85 90 

lie lie Pro Ala lie Ala Ala Asp Leu Lys Val Arg Asp Leu Leu 
95 100 105 

Val lie Ser Gly Leu Thr Gly Gly Lys Ala Lys Leu Ser Gly Val 
110 115 120 

Arg Phe lie Gin Glu Asp Tyr Lys Val Thr Val Ser Ala Glu Leu 
125 130 135 

Glu Gly Leu Asp Glu Ser Ala Ala Gin Val Met Ala Thr Tyr Val 
140 145 150 

Trp Asn Leu Gly Tyr Asp Gly Ser Gly lie Thr lie Gly lie lie 
155 160 165 

Asp Thr Gly He Asp Ala Ser His Pro Asp Leu Gin Gly Lys Val 
170 175 180 

He Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr Pro Tyr Asp 
185 190 195 

Asp His Gly His Gly Thr His Val Ala Ser He Ala Ala Gly Thr 
200 205 210 

Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala Pro Gly Ala 
215 220 225 

Lys Leu Ala Gly He Lys Val Leu Gly Ala Asp Gly Ser Gly Ser 
230 235 240 

He Ser Thr He He Lys Gly Val Glu Trp Ala Val Asp Asn Lys 
245 250 255 

Asp Lys Tyr Gly He Lys Val He Asn Leu Ser Leu Gly Ser Ser 
260 265 270 



Gin Ser Ser Asp 



Gly 



275 



Thr Asp Ser Leu Ser Gin Ala Val Asn 
280 



Asn 
285 



Ala Trp Asp Ala Gly lie Val Val Cys Val Ala Ala Gly Asn Ser 
290 295 300 

Gly Pro Asn Thr Tyr Thr Val Gly Ser Pro Ala Ala Ala Ser Lys 
305 310 315 

Val lie Thr Val Gly Ala Val Asp Ser Asn Asp Asn lie Ala Ser 
320 325 330 

Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu Lys Pro Glu 
335 340 345 

Val Val Ala Pro Gly Val Asp lie lie Ala Pro Arg Ala Ser Gly 
350 355 360 

Thr Ser Met Gly Thr Pro lie Asn Asp Tyr Tyr Thr Lys Ala Ser 
365 370 375 

Gly Thr Ser Met Ala Thr Pro His Val Ser Gly Val Gly Ala Leu 
380 385 390 

lie Leu Gin Ala His Pro Ser Trp Thr Pro Asp Lys Val Lys Thr 
395 400 405 

Ala Leu He Glu Thr Ala Asp He Val Ala Pro Lys Glu He Ala 
410 415 420 

Asp He Ala Tyr Gly Ala Gly Arg Val Asn Val Tyr Lys Ala He 
425 430 435 

Lys Tyr Asp Asp Tyr Ala Lys Leu Thr Phe Thr Gly Ser Val Ala 
440 445 450 

Asp Lys Gly Ser Ala Thr His Thr Phe Asp Val Ser Gly Ala Thr 
455 460 465 

Phe Val Thr Ala Thr Leu Tyr Trp Asp Thr Gly Ser Ser Asp He 
470 475 480 

Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Glu Val Asp Tyr Ser 
485 490 495 

Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro 
500 505 510 

Thr Ala Gly Thr Trp Thr Val Lys Val Val Ser Tyr Lys Gly Ala 
515 520 525 

Ala Asn Tyr Gin Val Asp Val Val Ser Asp Gly Ser Leu Ser Gin 
530 535 540 

Ser Gly Gly Gly Asn Pro Asn Pro Asn Pro Asn Pro Asn Pro Thr 
545 550 555 

Pro Thr Thr Asp Thr Gin Thr Phe Thr Gly Ser Val Asn Asp Tyr 
560 ■ 565 570 

Trp Asp Thr Ser Asp Thr Phe Thr Met Asn Val Asn Ser Gly Ala 
575 580 585 

Thr Lys He Thr Gly Asp Leu Thr Phe Asp Thr Ser Tyr Asn Asp 



590 



595 



600 
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Leu" Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Leu Val Asp Arg 
605 610 615 

Ser Thr Ser Ser Asn Ser Tyr Glu His Val Glu Tyr Ala Asn Pro 
620 625 630 

Ala Pro Gly Thr Trp Thr Phe Leu Val Tyr Ala Tyr Ser Thr Tyr 
635 640 645 

Gly Trp Ala Asp Tyr Gin Leu Lys Ala Val Val Tyr Tyr Gly 
650 655 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 





ATGAAGGGGC 


TGAAAGCTCT 


CATATTAGTG 


ATTTTAGTTC 


TAGGTTTGGT 


AGTAGGGAGC 


60 


sz 


GTAGCGGCAG 


CTCCAGAGAA 


GAAAGTTGAA 


CAAGTAAGAA 


ATGTTGAGAA 


GAACTATGGT 


120 




CTGCTAACGC 


CAGGACTGTT 


CAGAAAAATT 


CAAAAATTGA 


ATCCTAACGA 


GGAAATCAGC 


180 


In 


AC AGTAAT TG 


TATTTGAAAA 


CCATAGGGAA 


AAAGAAATTG 


CAGTAAGAGT 


TCTTGAGTTA 


240 




ATGGGTGCAA 


AAGTTAGGTA 


TGTGTACCAT 


ATTATACCCG 


CAATAGCTGC 


CGATCTTAAG 


300 


o 


GTTAGAGACT 


TACTAGTCAT 


CTCAGGTTTA 


ACAGGGGGTA 


AAGCTAAGCT 


TTC AG GTGTT 


360 




AGGTTTATCC 


AGGAAGACTA 


CAAAGTTACA 


GTTTCAGCAG 


AATTAGAAGG 


ACTGGATGAG 


420 




TCTGCAGCTC 


AAGTTATGGC 


AACTTACGTT 


TGGAACTTGG 


GATATGATGG 


TTCTGGAATC 


480 




ACAATAGGAA 


TAATTGACAC 


TGGAATTGAC 


GCTTCTCATC 


CAGATCTCCA 


AGGAAAAGTA 


540 




ATTGGGTGGG 


TAGATTTTGT 


CAATGGTAGG 


AGTTATCCAT 


ACGATGACCA 


TGGACATGGA 


600 




ACTCATGTAG 


CTTCAATAGC 


AGCTGGTACT 


GGAGCAGCAA 


GTAATGGCAA 


GTACAAGGGA 


660 




ATGGCTCCAG 


GAGCTAAGCT 


GGCGGGAATT 


AAGGTTCTAG 


GTGCCGATGG 


TTCTGGAAGC 


720 




ATATCTACTA 


TAATTAAGGG 


AGTTGAGTGG 


GCCGTTGATA 


ACAAAGATAA 


GTACGGAATT 


780 




AAGGTCATTA 


ATCTTTCTCT 


TGGTTCAAGC 


CAGAGCTCCG 


ACGGAACCGA 


CTCCCTCAGT 


840 




CAGGCCGTCA 


ACAACGCCTG 


GGACGCCGGT 


ATAGTAGTCT 


GCGTCGCCGC 


CGGCAACAGC 


900 




GGGCCGAACA 


CCTACACCGT 


CGGCTCACCC 


GCCGCCGCGA 


GCAAGGTCAT 


AACCGTCGGT 


960 




GCAGTTGACA 


GCAACGACAA 


CATCGCCAGC 


TTCTCCAGCA 


GGGGACCGAC 


CGCGGACGGA 


1020 




AGGCTCAAGC 


CGGAAGTCGT 


CGCCCCCGGC 


GTTGACATCA 


TAGCCCCGCG 


CGCCAGCGGA 


1080 




ACCAGCATGG 


GCACCCCGAT 


AAACGACTAC 


TACACCAAGG 


CCTCTGGAAC 


CAGCATGGCC 


1140 




ACCCCGCACG 


TTTCGGG CGT 


TGGCGCGCTC 


ATCCTCCAGG 


CCCACCCGAG 


CTGGACCCCG 


1200 




GACAAGGTGA 


AGACCGCCCT 


CATCGAGACC 


G C CG AC AT AG 


TCGCCCCCAA 


GG AG AT AG CG 


1260 




GACATCGCCT 


ACGGTGCGGG 


TAGGGTGAAC 


GTCTACAAGG 


CCATCAAGTA 


CGACGACTAC 


1320 
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GCCAAGCTCA CCTTCACCGG CTCCGTCGCC GACAAGGGAA GCGCCACCCA CACCTTCGAC 1380 

GTCAGCGGCG CCACCTTCGT GACCGCCACC CTCTACTGGG ACACGGGCTC GAGCGACATC 14 4 0 

GACCTCTACC TCTACGACCC CAACGGGAAC GAGGTTGACT ACTCCTACAC CGCCTACTAC 1500 

GGCTTCGAGA AGGTCGGCTA CTACAACCCG ACCGCCGGAA CCTGGACGGT CAAGGTCGTC 1560 

AGCTACAAGG GCGCGGCGAA CTACCAGGTC GACGTCGTCA GCGACGGGAG CCTCAGCCAG 1620 

TCCGGCGGCG GCAACCCGAA TCCAAACCCC AACCCGAACC CAACCCCGAC CACCGACACC 16 80 

CAGACCTTCA CCGGTTCCGT TAACGACTAC TGGGACACCA GCGACACCTT CACCATGAAC 174 0 

GTCAACAGCG GTGCCACCAA GATAACCGGT GACCTGACCT TCGATACTTC CTACAACGAC 1800 

CTCGACCTCT ACCTCTACGA CCCCAACGGC AACCTCGTTG ACAGGTCCAC GTCGAGCAAC 1860 

AGCTACGAGC ACGTCGAGTA CGCCAACCCC GCCCCGGGAA CCTGGACGTT CCTCGTCTAC 1920 

GCCTACAGCA CCTACGGCTG GGCGGACTAC CAGCTCAAGG CCGTCGTCTA CTACGGG 1977 

yp (2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
4= (A) LENGTH: 4765 base pairs 

1=5: (B) TYPE: nucleic acid 

jjfl (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

UJ <ii) MOLECULE TYPE: genomic DNA 

q (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

J TTTAAATTAT AAGATATAAT CACTCCGAGT GATGAGTAAG ATACATCATT ACAGTCCCAA 60 

jE AATGTTTATA ATTGG AACG C AGTGAATATA CAAAATGAAT ATAACCTCGG AGGTGACTGT 120 

™f AGAATGAATA AGAAGGGACT TACTGTGCTA TTTATAGCGA TAATGCTCCT TTCAGTAGTT 180 

CCAGTGCACT TTGTGTCCGC AGAAACACCA CCGGTTAGTT CAGAAAATTC AACAACTTCT 24 0 

ATACTCCCTA ACCAACAAGT TGTGACAAAA GAAGTTTCAC AAGCGGCGCT TAATGCTATA 300 

ATGAAAGGAC AACCCAACAT GGTTCTTATA ATCAAGACTA AGGAAGGCAA ACTTGAAGAG 360 

GCAAAAACCG AGCTTGAAAA GCTAGGTGCA GAGATTCTTG ACGAAAATAG AGTTCTTAAC 4 20 

ATGTTGCTAG TTAAGATTAA GCCTGAGAAA GTTAAAGAGC TCAACTATAT CTCATCTCTT 4 80 

GAAAAAGCCT GGCTTAACAG AGAAGTTAAG CTTTCCCCTC CAATTGTCGA AAAGGACGTC 540 

AAGACTAAGG AGCCCTCCCT AGAACCAAAA ATGTATAACA GCACCTGGGT AATTAATGCT 6 00 

CTCCAGTTCA TCCAGGAATT TGGATATGAT GGTAGTGGTG TTGTTGTTGC AGTACTTGAC 660 

ACGGG AG TTG ATCCGAACCA TCCTTTCTTG AGCATAACTC CAGATGGACG CAGGAAAATT 720 

ATAGAATGGA AGGATTTTAC AGACGAGGGA TTCGTGGATA CATCATTCAG CTTTAGCAAG 780 

GTTGTAAATG GGACTCTTAT AATTAACACA ACATTCCAAG TGGCCTCAGG TCTCACGCTG 840 

AATGAATCGA CAGGACTTAT GGAATACGTT GTTAAGACTG TTTACGTGAG CAATGTGACC 900 

ATTGGAAATA TCACTTCTGC TAATGGCATC TATCACTTCG GCCTGCTCCC AGAAAGATAC 960 
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TTCGACTTAA ACTTCGATGG TGATCAAGAG GACTTCTATC CTGTCTTATT AGTTAACTCC 1020 

ACTGGCAATG GTTATGACAT TGCATATGTG G AT ACTG AC C TTGACTACGA CTTCACCGAC 1080 

GAAGTTCCAC TTGGCCAGTA CAACGTTACT TATGATGTTG CTGTTTTTAG CTACTACTAC 114 0 

GGTCCTCTCA ACTACGTGCT TGCAGAAATA GATCCTAACG GAGAATATGC AGTATTTGGG 12 00 

TGGGATGGTC ACGGTCACGG AACTCACGTA GCTGGAACTG TTGCTGGTTA CGACAGCAAC 1260 

AATGATGCTT GGGATTGGCT CAGTATGTAC TCTGGTGAAT GGG AAGTGTT CTCAAGACTC 1320 

TATGGTTGGG ATTATACGAA CGTTACCACA G AC AC CGTG C AGGGTGTTGC TCCAGGTGCC 1380 

CAAATAATGG CAATAAGAGT TCTTAGGAGT GATGGACGGG GTAGCATGTG GGATATTATA 1440 

GAAGGTATGA CATACGCAGC AACCCATGGT GCAGACGTTA TAAGCATGAG TCTCGGTGGA 1500 

AATGCTCCAT ACTTAGATGG TACTGATCCA GAAAGCGTTG CTGTGGATGA GCTTACCGAA 1560 

AAGTACGGTG TTGTATTCGT AATAGCTGCA GGAAATGAAG GTCCTGGCAT TAACATCGTT 1620 

GGAAGTCCTG GTGTTGCAAC AAAGGCAATA ACTGTTGGAG CTGCTGCAGT GCCCATTAAC 1680 

GTTGGAGTTT ATGTTTCCCA AGCACTTGGA TATCCTGATT ACTATGGATT CTATTACTTC 174 0 

CCCGCCTACA CAAACGTTAG AATAGCATTC TTCTCAAGCA GAGGGCCGAG AATAGATGGT 1800 

GAAATAAAAC CCAATGTAGT GGCTCCAGGT TACGGAATTT ACTCATCCCT GCCGATGTGG 186 0 

ATTGGCGGAG CTGACTTCAT GTCTGGAACT TCGATGGCTA CTCCACATGT CAGCGGTGTC 192 0 

GTTGCACTCC TCATAAGCGG GGCAAAGGCC GAGGGAATAT ACTACAATCC AGATATAATT 1980 

AAGAAGGTTC TTGAGAGCGG TGCAACCTGG CTTGAGGGAG ATCCATATAC TGGGCAGAAG 2 04 0 

TAC ACTG AG C TTGACCAAGG TCATGGTCTT GTTAACGTTA CCAAGTCCTG GGAAATC CTT 2100 

AAGGCTATAA ACGGCACCAC TCTCCCAATT GTTGATCACT GGGCAGACAA GTCCTACAGC 2160 

GACTTTGCGG AGTACTTGGG TGTGGACGTT ATAAGAGGTC TCTACGCAAG GAACTCTATA 222 0 

CCTGACATTG TCGAGTGGCA CATTAAGTAC GTAGGGGACA CGGAGTACAG AACTTTTGAG 22 8 0 

ATCTATGCAA CTGAGCCATG GATTAAGCCT TTTGTCAGTG GAAGTGTAAT TCTAGAGAAC 234 0 

AATAC CGAGT TTGTCCTTAG GGTGAAATAT GATGTAGAGG GTCTTGAGCC AGGTCTCTAT 24 00 

GTTGGAAGGA TAATCATTGA TGATCCAACA ACGCCAGTTA TTGAAGACGA GATCTTGAAC 2460 

ACAATTGTTA TTCCCGAGAA GTTCACTCCT GAGAACAATT ACACCCTCAC CTGGTATGAT 2 52 0 

ATTAATGGTC CAGAAATGGT GACTCACCAC TTCTTCACTG TGCCTGAGGG AGTGGACGTT 2580 

CTCTACGCGA TGACCACATA CTGGGACTAC GGTCTGTACA GACCAGATGG AATGTTTGTG 264 0 

TTCCCATACC AGCTAGATTA TCTTCCCGCT GCAGTCTCAA ATCCAATGCC TGGAAACTGG 2700 

GAGCTAGTAT GGACTGGATT TAACTTTGCA CCCCTCTATG' AGTCGGGCTT CCTTGTAAGG 2760 

ATTTACGGAG TAGAGATAAC TCCAAGCGTT TGGTACATTA ACAGGACATA CCTTGACACT 2820 

AACACTGAAT TCTCAATTGA ATTCAATATT ACTAACATCT ATGCCCCAAT TAATGCAACT 2880 

CTAATCCCCA TTGGCCTTGG AACCTACAAT GCGAGCGTTG AAAGCGTTGG TGATGGAGAG 294 0 

TTCTTCATAA AGGGCATTGA AGTTCCTGAA GGCACCGCAG AGTTGAAGAT TAGpATAGGC 3000 



in 
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AACCCAAGTG TTCCGAATTC AG ATC TAG AC TTGTACCTTT ATGACAGTAA AGGCAATTTA 3060 

GTGGCCTTAG ATGGAAACCC AACAGCAGAA GAAGAGGTTG TAGTTGAGTA TCCTAAGCCT 3120 

GGAGTTTATT CAATAGTAGT ACATGGTTAC AGCGTCAGGG ACGAAAATGG TAATCCAACG 3180 

ACAACCACCT TTGACTTAGT TGTTCAAATG ACCCTTGATA ATGGAAACAT AAAGCTTGAC 324 0 

AAAGACTCGA TTATTCTTGG AAGCAATGAA AGCGTAGTTG TAACTGCAAA CATAACAATT 3300 

GATAGAGATC ATCCTACAGG AGTATACTCT GGTATCATAG AGATTAGAGA TAATGAGGTC 3360 

TACCAGGATA CAAATACTTC AATTG CG AAA ATACCCATAA CTTTGGTAAT TGACAAGGCG 3420 

GACTTTGCCG TTGGTCTCAC ACCAGCAGAG GGAGTACTTG GAGAGGCTAG AAATT AC AC T 34 80 

CTAATTGTAA AGCATGCCCT AACACTAGAG CCTGTGCCAA ATGCTACAGT GATTATAGGA 3 54 0 

AACT AC AC CT ACCTCACAGA CGAAAACGGT ACAGTGACAT TCACGTATGC TCCAACTAAG 3600 

TTAGGCAGTG ATGAAATCAC AGTCATAGTT AAGAAAGAGA ACTTCAACAC ATTAGAGAAG 3660 

ACCTTCCAAA TCACAGTATC AGAGCCTGAA ATAACTGAAG AGGACATAAA TGAGCCCAAG 3720 

CTTG CAATGT CATCACCAGA AGCAAATGCT ACCATAGTAT CAGTTGAGAT GGAGAGTGAG 3 780 

GGTGGCGTTA AAAAGACAGT GACAGTGGAA ATAACTATAA ACGGAACCGC TAATGAGACT 3 840 

GCAACAATAG TGGTTCCTGT TCCTAAGAAG GCCGAAAACA TCGAGGTAAG TGGAGACCAC 3 900 

GTAATTTCCT ATAGTATAGA GGAAGGAGAG TACGCCAAGT ACGTTATAAT TACAGTGAAG 3 960 

TTTGCATCAC CTGTAACAGT AACTGTTACT TACACTATCT ATGCTGGCCC AAGAGTCTCA 4 02 0 

ATCTTGACAC TTAACTTCCT TGGCTACTCA TGGTACAGAC TATATTCACA GAAGTTTGAC 4 08 0 

GAATTGTACC AAAAGGCCCT TGAATTGGGA GTGGACAACG AGACATTAGC TTTAGCCCTC 414 0 

AGCTACCATG AAAAAGCCAA AGAGTACTAC GAAAAGGCCC TTGAGCTTAG CGAGGGTAAC 42 00 

ATAATCCAAT ACCTTGGAGA CATAAGACTA TTACCTCCAT TAAGACAGGC ATACATCAAT 4260 

GAAATGAAGG CAGTTAAGAT ACTGGAAAAG GCCATAGAAG AATTAGAGGG TGAAGAGTAA 4320 

TCTCCAATTT TTCCCACTTT TT CTTTT AT A ACATTCCAAG CCTTTTCTTA GCTTCTTCGC 43 80 

TCATTCTATC AGGAGTCCAT GGAGGATCAA AGGTAAG TT C AACCTCCACA TCTCTTACTC 4440 

CTGGGATTTC GAGTACTTTC TCCTCTACAG CTCTAAGAAG CCAGAGAGTT AAAGGACACC 4 500 

C AGGAGTTGT ' CATTGTCATC TTTATATATA CCGTTTTGTC AGGATTAATC TTTAGCTCAT 4 560 

AAATTAATCC AAGGTTTACA ACATCCATCC CAATTTCTGG GTCGATAACC TCCTTTAGCT 4620 

TTTCCAGAAT CATTTCTTCA GTAATTTCAA GGTTCTCATC TTTGGTTTCT CTCACAAACC 4 680 

CAATTTCAAC CTGCCTGATA CCTTCTAACT CCCTAAGCTT GTTATATATC TCCAAAAGAG 4 74 0 

TGGCATCATC AATTTTCTCT TTAAA 4 765 

(2) INFORMATION FOR SEQ ID NO : 8 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1398 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 



- 127 - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Asn Lys Lys Gly Leu Thr Val Leu Phe lie Ala lie Met Leu 

5 10 15 

Leu Ser Val Val Pro Val His Phe Val Ser Ala Glu Thr Pro Pro 
20 25 30 

Val Ser Ser Glu Asn Ser Thr Thr Ser lie Leu Pro Asn Gin Gin 
35 40 45 

Val Val Thr Lys Glu Val Ser Gin Ala Ala Leu Asn Ala He Met 
50 55 60 

Lys Gly Gin Pro Asn Met Val Leu He He Lys Thr Lys Glu Gly 
65 70 ' 75 

Lys Leu Glu Glu Ala Lys Thr Glu Leu Glu Lys Leu Gly Ala Glu 
80 85 90 

He Leu Asp Glu Asn Arg Val Leu Asn Met Leu Leu Val Lys He 
95 100 105 

Lys Pro Glu Lys Val Lys Glu Leu Asn Tyr He Ser Ser Leu Glu 
110 115 120 

Lys Ala Trp Leu Asn Arg Glu Val Lys Leu Ser Pro Pro He Val 
125 130 135 

Glu Lys Asp Val Lys Thr Lys Glu Pro Ser Leu Glu Pro Lys Met 
140 145 150 

Tyr Asn Ser Thr Trp Val He Asn Ala Leu Gin Phe He Gin Glu 
155 160 165 

Phe Gly Tyr Asp Gly Ser Gly Val Val Val Ala Val Leu Asp Thr 
170 175 180 

Gly Val Asp Pro Asn His Pro Phe Leu Ser He Thr Pro Asp Gly 
185 190 195 

Arg Arg Lys He He Glu Trp Lys Asp Phe Thr Asp Glu Gly Phe 
200 205 210 

Val Asp Thr Ser Phe Ser Phe Ser Lys Val Val Asn Gly Thr Leu 
215 220 225 

He He Asn Thr Thr Phe Gin Val Ala Ser Gly Leu Thr Leu Asn 
230 235 240 

Glu Ser Thr Gly Leu Met Glu Tyr Val Val Lys Thr Val Tyr Val 
245 - 250 255 

Ser Asn Val Thr He Gly Asn He Thr Ser Ala Asn Gly He Tyr 
260 265 270 

His Phe Gly Leu Leu Pro Glu Arg Tyr Phe Asp Leu Asn Phe Asp 
275 280 285 

Gly Asp Gin Glu Asp Phe Tyr Pro Val Leu Leu Val Asn Ser Thr 
290 295 300 
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Gly Asn Gly Tyr Asp lie Ala Tyr Val Asp Thr Asp Leu Asp Tyr 
305 310 315 

Asp Phe Thr Asp Glu Val Pro Leu Gly Gin* Tyr Asn Val Thr Tyr 
320 325 330 

Asp Val Ala Val Phe Ser Tyr Tyr Tyr Gly Pro Leu Asn Tyr Val 
335 340 345 

Leu Ala Glu lie Asp Pro Asn Gly Glu Tyr Ala Val Phe Gly Trp 
350 355 360 

Asp Gly His Gly His Gly Thr His Val Ala Gly Thr Val Ala Gly 
365 370 375 

Tyr Asp Ser Asn Asn Asp Ala Trp Asp Trp Leu Ser Met Tyr Ser 
380 385 . 390 

Gly Glu Trp Glu Val Phe Ser Arg Leu Tyr Gly Trp Asp Tyr Thr 
395 400 405 

Asn Val Thr Thr Asp Thr Val Gin Gly Val Ala Pro Gly Ala Gin 
410 415 420 

lie Met Ala lie Arg Val Leu Arg Ser Asp Gly Arg Gly Ser Met 
425 430 435 

Trp Asp lie lie Glu Gly Met Thr Tyr Ala Ala Thr His Gly Ala 
440 445 450 

Asp Val lie Ser Met Ser Leu Gly Gly Asn Ala Pro Tyr Leu Asp 
455 460 465 

Gly Thr Asp Pro Glu Ser Val Ala Val Asp Glu Leu Thr Glu Lys 
470 475 480 

Tyr Gly Val Val Phe Val He Ala Ala Gly Asn Glu Gly Pro Gly 
485 490 495 

He Asn He Val Gly Ser Pro Gly Val Ala Thr Lys Ala lie Thr 
500 505 510 

Val Gly Ala Ala Ala Val Pro He Asn Val Gly Val Tyr Val Ser 
515 520 525 

Gin Ala Leu Gly Tyr Pro Asp Tyr Tyr Gly Phe Tyr Tyr Phe Pro 
530 535 540 

Ala Tyr Thr Asn Val Arg He Ala Phe Phe Ser Ser Arg Gly Pro 
545 550 555 

Arg He Asp Gly Glu He Lys Pro Asn Val Val Ala Pro Gly Tyr 
560 565 570 

Gly He Tyr Ser Ser Leu Pro Met Trp He Gly Gly Ala Asp Phe 
575 580 585 

Met Ser Gly Thr Ser Met Ala Thr Pro His Val Ser Gly Val Val 
590 595 600 

Ala Leu Leu He Ser Gly Ala Lys Ala Glu Gly He Tyr Tyr Asn 
605 610 615 

Pro Asp He He Lys Lys Val Leu Glu Ser Gly Ala Thr Trp Leu 
620 625 630 



Glu' Gly Asp Pro Tyr Thr Gly Gin Lys Tyr Thr Glu Leu Asp Gin 
635 640 645 

Gly His Gly Leu Val Asn Val Thr Lys Ser Trp Glu lie Leu Lys 
650 655 660 

Ala lie Asn Gly Thr Thr Leu Pro He Val Asp His Trp Ala Asp 
665 670 675 

Lys Ser Tyr Ser Asp Phe Ala Glu Tyr Leu Gly Val Asp Val He 
680 685 690 

Arg Gly Leu Tyr Ala Arg Asn Ser He Pro Asp He Val Glu Trp 
695 700 705 

His He Lys Tyr Val Gly Asp Thr Glu Tyr Arg Thr Phe Glu He 
710 715 720 

Tyr Ala Thr Glu Pro Trp He Lys Pro Phe Val Ser Gly Ser Val 
725 730 735 

He Leu Glu Asn Asn Thr Glu Phe Val Leu Arg Val Lys Tyr Asp 
740 745 750 

Val Glu Gly Leu Glu Pro Gly Leu Tyr Val Gly Arg He He He 
755 760 765 

Asp Asp Pro Thr Thr Pro Val He Glu Asp Glu He Leu Asn Thr 
770 775 780 

He Val He Pro Glu Lys- Phe Thr Pro Glu Asn Asn Tyr Thr Leu 
785 790 795 

Thr Trp Tyr Asp He Asn Gly Pro Glu Met Val Thr His His Phe 
800 805 810 

Phe Thr Val Pro Glu Gly Val Asp Val Leu Tyr Ala Met Thr Thr 
815 820 825 

Tyr Trp Asp Tyr Gly Leu Tyr Arg Pro Asp Gly Met Phe Val Phe 
830 835 840 

Pro Tyr Gin Leu Asp Tyr Leu Pro Ala Ala Val Ser Asn Pro Met 
845 850 855 

Pro Gly Asn Trp Glu Leu Val Trp Thr Gly Phe Asn Phe Ala Pro 
860 865 870 

Leu Tyr Glu Ser Gly Phe Leu Val Arg He Tyr Gly Val Glu He 
875 880 885 

Thr Pro Ser Val Trp Tyr He Asn Arg Thr Tyr Leu Asp Thr Asn 
890 895 900 

Thr Glu Phe Ser He Glu Phe Asn He Thr Asn He Tyr Ala Pro 
905 910 915 

He Asn Ala Thr Leu He Pro He Gly Leu Gly Thr Tyr Asn Ala 
920 925 930 

Ser Val Glu Ser Val Gly Asp Gly Glu Phe Phe He Lys Gly He 
935 940 945 

Glu Val Pro Glu Gly Thr Ala Glu Leu Lys He Arg He Gly Asn 
950 955 960 
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Pro Ser Val Pro Asn Ser Asp Leu Asp Leu Tyr Leu Tyr Asp Ser 
965 970 975 

Lys Gly Asn Leu Val Ala Leu Asp Gly Asn Pro Thr Ala Glu Glu 
980 985 990 

Glu Val Val Val Glu Tyr Pro Lys Pro Gly Val Tyr Ser lie Val 
995 1000 1005 

Val His Gly Tyr Ser Val Arg Asp Glu Asn Gly Asn Pro Thr Thr 
1010 1015 1020 

Thr Thr Phe Asp Leu Val Val Gin Met Thr Leu Asp Asn Gly Asn 
1025 1030 1035 

lie Lys Leu Asp Lys Asp Ser lie lie Leu Gly Ser Asn Glu Ser 
1040 1045 1050 

Val Val Val Thr Ala Asn He Thr He Asp Arg Asp His Pro Thr 
1055 1060 1065 

Gly Val Tyr Ser Gly He He Glu He Arg Asp Asn Glu Val Tyr 
1070 1075 1080 

Gin Asp Thr Asn Thr Ser He Ala Lys He Pro He Thr Leu Val 
^ 1085 1090 1095 

*F He Asp Lys Ala Asp Phe Ala Val Gly Leu Thr Pro Ala Glu Gly 

Mb 1100 1105 1110 

Val Leu Gly Glu Ala Arg Asn Tyr Thr Leu He Val Lys His Ala 
PJ 1115 1120 1125 

- Leu Thr Leu Glu Pro Val Pro Asn Ala Thr Val He He Gly Asn 
^ 1130 1135 1140 

4= Tyr Thr Tyr Leu Thr Asp Glu Asn Gly Thr Val Thr Phe Thr Tyr 

fy 1145 1150 1155 

— Ala Pro Thr Lys Leu Gly Ser Asp Glu He Thr Val lie Val Lys 
J-f 1160 1165 1170 

Lys Glu Asn Phe Asn Thr Leu Glu Lys Thr Phe Gin He Thr Val 
1175 1180 1185 

Ser Glu Pro Glu He Thr Glu Glu Asp He Asn Glu Pro Lys Leu 
1190 1195 1200 

Ala Met Ser Ser Pro Glu Ala Asn Ala Thr He Val Ser Val Glu 
1205 1210 1215 

Met Glu Ser Glu Gly Gly Val Lys Lys Thr Val Thr Val Glu He 
1220 1225 1230 

Thr He Asn Gly Thr Ala Asn Glu Thr Ala Thr He Val Val Pro 
1235 1240 1245 

Val Pro Lys Lys Ala Glu Asn He Glu Val Ser Gly Asp His Val 
1250 1255 1260 

. He Ser Tyr Ser He Glu Glu Gly Glu Tyr Ala Lys Tyr Val He 
1265 1270 1275 

He Thr Val Lys Phe Ala Ser Pro Val Thr Val Thr Val Thr Tyr 
1280 1285 1290 
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Thr lie Tyr Ala Gly Pro Arg Val 
1295 

Leu Gly Tyr Ser Trp Tyr Arg Leu 
1310 

Leu Tyr Gin Lys Ala Leu Glu Leu 
1325 

Ala Leu Ala Leu Ser Tyr His Glu 
1340 

Lys Ala Leu Glu Leu Ser Glu Gly 
1355 

Asp lie Arg Leu Leu Pro Pro Leu 
1370 

Met Lys Ala Val Lys lie Leu Glu 
1385 

Gly Glu Glu 



(2) INFORMATION FOR SEQ ID NO : 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GGWWS DRRTG TTRRHGTHGC DGTDMTYGAC ACBGG 35 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
KSTCACGGAA CTCACGTDGC BGGMACDGTT GC 32 



Ser lie Leu Thr Leu Asn Phe 
1300 1305 

Tyr Ser Gin Lys Phe Asp Glu 
1315 1320 

Gly Val Asp Asn Glu Thr Leu 
1330 1335 

Lys Ala Lys Glu Tyr Tyr Glu 
1345 1350 

Asn lie lie Gin Tyr Leu Gly 
1360 1365 

Arg Gin Ala Tyr lie Asn Glu 
1375 1380 

Lys Ala lie Glu Glu Leu Glu 
1390 1395 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ASCMGCAACH GTKCCVGCHA CGTGAGTTCC GTG 



33 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CHCCGSYVAC RTGBGGAGWD GCCATBGAVG TDCC 34 



(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

A GTT GCG GTA ATT GAC ACG GGT ATA GAC GCG AAC CAC CCC GAT CTG 4 6 

Val Ala Val lie Asp Thr Gly He Asp Ala Asn His Pro Asp Leu 

5 10 15 

AAG GGC AAG GTC ATA GGC TGG TAC GAC GCC GTC AAC GGC AGG TCG 91 
Lys Gly Lys Val He Gly Trp Tyr Asp Ala Val Asn Gly Arg Ser 
20 25 30 

ACC CCC TAC GAT GAC CAG GGA CAC GGA ACT CAC GTN GCN GGA ACN 136 
Thr Pro Tyr Asp Asp Gin Gly His Gly Thr His Val Ala Gly Thr 
35 40 45 

GTT GCT GGT 14 5 

Val Ala Gly 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TCT CAC GGA ACT CAC GTG GCG GGA ACA GTT GCC GGA ACA GGC AGC 4 5 

Ser His Gly Thr His Val Ala Gly Thr Val Ala Gly Thr Gly Ser 

5- 10 15 

GTT AAC TCC CAG TAC ATA GGC GTC GCC CCC GGC GCG AAG CTC GTC 90 
Val Asn Ser Gin Tyr He Gly Val Ala Pro Gly Ala Lys Leu Val 
20 25 30 

GGT GTC AAG GTT CTC GGT GCC GAC GGT TCG GGA AGC GTC TCC ACC 13 5 

Gly Val Lys Val Leu Gly Ala Asp Gly Ser Gly Ser Val Ser Thr 
35 40 45 
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ATC ATC GCG GGT GTT GAC TGG GTC GTC CAG AAC AAG GAT AAG TAC 180 

lie lie Ala Gly Val Asp Trp Val Val Gin Asn Lys Asp Lys Tyr 
50 55 60 

GGG ATA AGG GTC ATC AAC CTC TCC CTC GGC TCC TCC CAG AGC TCC 22 5 

Gly lie Arg Val lie Asn Leu Ser Leu Gly Ser Ser Gin Ser Ser 
65 70 75 

GAC GGA GCC GAC TCC CTC AGT CAG GCC GTC AAC AAC GCC TGG GAC 2 70 

Asp Gly Ala Asp Ser Leu Ser Gin Ala Val Asn Asn Ala Trp Asp 
80 85 90 

GCC GGT ATA GTA GTC TGC GTC GCC GCC GGC AAC AGC GGG CCG AAC 315 

Ala Gly lie Val Val Cys Val Ala Ala Gly Asn Ser Gly Pro Asn 
95 100 105 

ACC TAC ACC GTC GGC TCA CCC GCC GCC GCG AGC AAG GTC ATA ACC 3 60 

Thr Tyr Thr Val Gly Ser Pro Ala Ala Ala Ser Lys Val lie Thr 
110 115 120 

GTC GGT GCA GTT GAC AGC AAC GAC AAC ATC GCC AGC TTC TCC AGC 4 05 

Val Gly Ala Val Asp Ser Asn Asp Asn He Ala Ser Phe Ser Ser 
125 130 135 

AGG GGA CCG ACC GCG GAC GGA AGG CTC AAG CCG GAA GTC GTC GCC 4 50 

Arg Gly Pro Thr Ala Asp Gly Arg Leu Lys Pro Glu Val Val Ala 
^ 140 145 150 

2 CCC GGC GTT GAC ATC ATA GCC CCG CGC GCC AGC GGA ACC AGC ATG 4 95 

Pro Gly Val Asp lie He Ala Pro Arg Ala Ser Gly Thr Ser Met 
155 160 165 

Ixl GGC ACC CCG ATA AAC GAC TAC TAC ACC AAG GCC TCT GGA ACC TCA 54 0 

Gly Thr Pro He Asn Asp Tyr Tyr Thr Lys Ala Ser Gly Thr Ser 
Z_ 170 175 180 

=E ATG GCC ACT CCC CAT GTT ACC GGT 564 

m Met Ala Thr Pro His Val Thr Gly 

^ 185 

U: (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1859 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GAGCTCCGAC GGAACCGACT CCCTCAGTCA GGCCGTCAAC AACGCCTGGG ACGCCGGTAT 60 
AGTAGTCTGC GTCGCCGCCG GCAACAGCGG GCCGAACACC TACACCGTCG GCTCACCCGC 120 



CGCCGCGAGC AAGGTCATAA CCGTCGGTGC AGTTGACAGC AACGACAACA TCGCCAGCTT 180 

CTCCAGCAGG GGACCGACCG CGGACGGAAG GCTCAAGCCG GAAGTCGTCG CCCCCGGCGT 24 0 

TGACATCATA GCCCCGCGCG C CAGCGG AAC CAG C ATGGGC ACCCCGATAA ACGACTACTA 300 

CACCAAGGCC TCTGGAACCA GCATGGCCAC CCCGCACGTT TCGGGCGTTG GCGCGCTCAT 3 60 

CCTCCAGGCC CACCCGAGCT GGACCCCGGA CAAGGTGAAG ACCGCCCTCA TCGAGACCGC 42 0 
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CGACATAGTC 


GCCCCCAAGG 


AGATAGCGGA 


CATCGCCTAC 


GGTGCGGGTA 


GGGTGAACGT 


430 




CTACAAGGCC 


ATCAAGTACG 


ACGACTACGC 


CAAGCTCACC 


TTCACCGGCT 


CCGTCGCCGA 


540 




CAAGGGAAGC 


GCCACCCACA 


CCTTCGACGT 


CAGCGGCGCC 


ACCTTCGTGA 


CCGCCACCCT 


600 




CTACTGGGAC 


ACGGGCTCGA 


GCGACATCGA 


CCTCTACCTC 


TACGACCCCA 


ACGGGAACGA 


660 




GGTTGACTAC 


TCCTACACCG 


CCTACTACGG 


CTTCGAGAAG 


GTCGGCTACT 


ACAACCCGAC 


720 




CGCCGGAACC 


TGGACGGTCA 


AGGTCGTCAG 


CTACAAGGGC 


GCGGCGAACT 


ACCAGGTCGA 


780 




CGTCGTCAGC 


GACGGGAGCC 


TCAGCCAGTC 


CGGCGGCGGC 


AACCCGAATC 


CAAACCCCAA 


840 




CCCGAACCCA 


ACCCCGACCA 


CCGACACCCA 


GACCTTCACC 


GGTTC CGTT A 


ACGACTACTG 


900 




GGACACCAGC 


GACACCTTCA 


CCATGAACGT 


CAACAGCGGT 


GCCACCAAGA 


TAACCGGTGA 


960 




CCTGACCTTC 


GATACTTCCT 


ACAACGACCT 


CGACCTCTAC 


CTCTACGACC 


CCAACGGCAA 


1020 




CCTCGTTGAC 


AGGTCCACGT 


CGAGCAACAG 


CTACGAGCAC 


GTCGAGTACG 


CCAACCCCGC 


1080 




CCCGGGAACC 


TGGACGTTCC 


TCGTCTACGC 


CTACAGCACC 


TACGGCTGGG 


CGGACTACCA 


1140 


GCTCAAGGCC 


GTCGTCTACT 


ACGGGTGAAG 


GTTTTTAATC 


CCCTTTTCTT 


TCCCCTTTTG 


1200 


Jz 


AGGTGGTTGG 


GATGAAGCGG 


GTTCTTGCGG 


CGATCCTTGT 


AATCATGCTC 


ATCGGATTAT 


1260 


s s 


CATTCCCTGC 


CGGAAGTGCT 


AAAATCGAGC 


CCTACGTTTA 


CAGCCCCACC 


GTTCCGGATA 


1320 


^ = 


CCGCCTTCGC 


GGTTCTCACC 


CTGTACAGGA 


CCGGGGACTA 


CGCCCGGGTT 


CTCGAGGGAT 


1380 




ACGAGTGGCT 


CCTCCAGATG 


AGAACTCCCA 


TCGATTCGTG 


GGGGGTTTCC 


CGCGGGGAAA 


1440 




CGCACATGGC 


CAAGTACACG 


GCAATGGCGA 


TGCTGGCCCT 


CATGCGCGGC 


GAGAACGTGG 


1500 


. F= 


CGAGGGGGCG 


TTACAGGGAT 


GTTCTCAACG 


ACGCCGCGTA 


CTGGTTAATA 


TACAAACAGA 


1560 




ACCCGGACGG 


CTCGTGGGAG 


GACTACACCG 


GAACGGCGCT 


GGCCGTCATC 


GCGCTCGGGG 


1620 




AGTTCCTTAA 


GGG CGGGTAC 


ATCAACGCGA 


ACCTGACCGG 


CTTCAAAAAG 


CAGGTTAAAG 


1680 




AGGCCGTAAA 


CCGCGGGGAA 


GGCTGGCTGA 


TGGATGCGGA 


CCCAAAAACG 


GACGCGGATA 


1740 




GAATATTCGG 


CTACCTCGCC 


CTCGGTAAAA 


AGGACGAACT 


CAAAAAGATG 


AACCCTTCCG 


1800 




GTGACCTGAA 


GGCCTACCGC 


GCCTTTGCAC 


TTGCCTACCT 


CGGGGAGAGG 


GTCGAGCTC 


1859 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
TGTAGTAGTC GTTTATCGGG 2 0 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 14 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



AAGCTTAACA 


TCGAGCGCTC 


CACCTCTAAA 


GTAGGTGAGT 


GTGGATACGA 


AGGTTAGGGC 


60 


CGCTATGACG 


ACCTTCAGGA 


TCCCAACGGC 


TTCTTTTATG 


GGGAGCCCGG 


CGAAGGTGAG 


120 


AATTGAAAGG 


ATTACCATAC 


TCCCTCCGCT 


CATCATGGAG 


CCTATGAATC 


CCCCTCCAAA 


180 


AGAGAGAAGT 


GCTATAAGGA 


GCGTCCTCAT 


GTTCCATGCT 


ATGTTTTGGT 


ATTTAATGCT 


240 


TTTCCGCTTA 


ATGTTACACC 


TCCTCATGAC 


AATTTCGCGT 


TTAGGGATGG 


GGTTAATTGG 


300 


ACCCCTCCGA 


GCCACGGGTT 


GATGTCCATT 


ATGTCGATAT 


TCACCATCTT 


ATCCCCAACT 


360 


TTGTGGGTTT 


CAAACATTAC 


CCTACGTTAT 


ATTTTTATCG 


TCCTAATTAA 


CTGCTGAAAC 


420 


GGGCGCTTAT 


CGTTCATCGT 


TGATGGTTTT 


GGGTGACCGG 


GCATTAAGGA 


ATTGTGTCGT 


480 


TTGCTGAAAT 


TTATGAAACG 


GAGTTGGCTT 


CTTTATGTTA 


CATAAAGATG 


TACATTACTG 


540 


TAATGTATAT 


AAATGGAAGA 


AACACTGTTG 


CGTAAACTTT 


TTAATGTATC 


CAATATCAGT 


600 


ACTTCGATGT 


CCCGATATGG 


GACATGTTGG 


ATAGGAGGGT 


ACTGGAATGA 


AGAGGTTAGG 


660 


TGCTGTGGTG 


CTGGCACTGG 


TGCTCGTGGG 


TCTTCTGGCC 


GGAACGGCCC 


TTGCGGCACC 


720 


CGTAAAACCG 


GTTGTCAGGA 


ACAACGCGGT 


TCAGCAGAAG 


AACTACGGAC 


TGCTGACCCC 


780 


GGGACTGTTC 


AAGAAAGTCC 


AGAGGATGAA 


CTGGAACCAG 


GAAGTGGACA 


CCGTCATAAT 


840 


GTTCGGGAGC 


TACGGAGACA 


GGGACAGGGC 


GGTTAAGGTA 


CTGAGGCTCA 


TGGGCGCCCA 


900 


GGTCAAGTAC 


TCCTACAAGA 


TAATCCCTGC 


TGTCG CGGTT 


AAAATAAAGG 


CCAGGGACCT 


960 


TCTGCTGATC 


GCGGGCATGA 


TAGACACGGG 


TTACTTCGGT 


AACACAAGGG 


TCTCGGGCAT 


1020 


AAAGTTCATA 


CAGGAGGATT 


ACAAGGTTCA 


GGTTGACGAC 


GCCACTTCCG 


TCTCCCAGAT 


1080 


AGGGGCCGAT 


ACCGTCTGGA 


ACTCCCTCGG 


CTACGACGGA 


AGCGGTGTGG 


TGGTTGC CAT 


1140 


CGTCGATACG 


GGTATAGACG 


CGAACCACCC 


CGATCTGAAG 


GGCAAGGTCA 


TAGGCTGGTA 


1200 


CGACTCCGTC 


AACGGCAGGT 


CGACCCCCTA 


CGATGACCAG 


GGACACGGAA 


CCCACGTTGC 


1260 


GGGTATCGTT 


GCCGGAACCG 


GG AG CGTTAA 


CTCCCAGTAC 


ATAGGCGTCG 


GCCCCGGCGC 


1320 


GAAGCTCGTC 


GGCGTCAAGG 


TTCTCGGTTC 


CGACGGTTCG 


GGAAGCGTCT 


CCACCATCAT 


1380 


CGCGGGTGTT 


GACTGGAACG 


TCCAGAACTA 


GGACAAGTAC 


GGGATAAGGG 


TCATCAACCT 


1440 


CTCCCTCGGC 


TCCTCCCAGA 


GCTC 








1464 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AAAAGAATTC GGATCCATGA AGAGGTTAGG TGC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TTTTATCGAT CAGGCGTCCC AGGCGTTG 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CATTATAGGT AAGAGAGGAA TG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GATCCATTCC TCTCTTACCT AT AATG G T AC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



TAGCAGTAAT TGACACGGG 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TAGCAGTAAT TGACACTGG 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTGTTCCAGC TACGTGAGTT CC 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CTGTTCCAGC TACATGAGTT CC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

A CTA GTC ATC TCA GGT TTA ACA GGG GGT AAA GCT AAG CTT TCA GGT 
Leu Val lie Ser Gly Leu Thr Gly Gly Lys Ala Lys Leu Ser Gly 

5 10 15 

GTT AGG TTT ATC CAG GAA GAC TAC AAA GTT ACA GTT TCA GCA GAA 
Val Arg Phe He Gin Glu Asp Tyr Lys Val Thr Val Ser Ala Glu 
20 25 30 



TTA GAA GGA CTG GAT GAG TCT GCA GCT CAA GTT ATG GCA ACT TAC 
Leu Glu Gly Leu Asp Glu Ser Ala Ala Gin Val Met Ala Thr Tyr 
35 40 45 
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GTT TGG AAC TTG GGA 
Val Trp Asn Leu Gly 
50 

ATT GAC ACT GGA ATT 
lie Asp Thr Gly lie 
65 

GTA ATT GGG TGG GTA 
Val lie Gly Trp Val 
80 

GAT GAC CAT GGA CAT 
Asp Asp His Gly His 
95 

ACT GGA GCA GCA AGT 
Thr Gly Ala Ala Ser 
110 

GCT AAG CTG GCG GGA 
Ala Lys Leu Ala Gly 
125 

AGC ATA TCT ACT ATA 
Ser lie Ser Thr lie 
140 

AAA GAT AAG TAC GGA 
Lys Asp Lys Tyr Gly 
155 

AGC CAG AGC TC 
Ser Gin Ser 



TAT GAT GGT TCT GGA 
Tyr Asp Gly Ser Gly 
55 

GAC GCT TCT CAT CCA 
Asp Ala Ser His Pro 
70 

GAT TTT GTC AAT GGT 
Asp Phe Val Asn Gly 
85 

GGA ACT CAT GTA GCT 
Gly Thr His Val Ala 
100 

AAT GGC AAG TAC AAG 
Asn Gly Lys Tyr Lys 
115 

ATT AAG GTT CTA GGT 
lie Lys Val Leu Gly 
130 

ATT AAG GGA GTT GAG 
lie Lys Gly Val Glu 
145 

ATT AAG GTC ATT AAT 
lie Lys Val lie Asn 
160 



ATC ACA ATA GGA ATA 
lie Thr lie Gly lie 
60 

GAT CTC CAA GGA AAA 
Asp Leu Gin Gly Lys 
75 

AGG AGT TAT CCA TAC 
Arg Ser Tyr Pro Tyr 
90 

TCA ATA GCA GCT GGT 
Ser He Ala Ala Gly 
105 

GGA ATG GCT CCA GGA 
Gly Met Ala Pro Gly 
120 

GCC GAT GGT TCT GGA 
Ala Asp Gly Ser Gly 
135 

TGG GCC GTT GAT AAC 
Trp Ala Val Asp Asn 
150 

CTT TCT CTT GGT TCA 
Leu Ser Leu Gly Ser 
165 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TG AC AC TGG A ATTGACGCTT CTCATCCAGA 



(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TCTCCAAGGA AAAGTAATTG GGTGGGTAGA 



(2) INFORMATION FOR SEQ ID NO:29: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTTGCCATAA CTTGAGCTGC AGACTCATCC 30 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 30: 

TTTATTAAGC ATAAAATAGC CATGCAACTT TGATCACTAA TGTGCGGTGG TGCAC ATG 58 

Met 

AAG GGG CTG AAA GCT CTC ATA TTA GTG ATT TTA GTT CTA GGT TTG 103 
Lys Gly Leu Lys Ala Leu lie Leu Val lie Leu Val Leu Gly Leu 
5 10 15 

GTA GTA GGG AGC GTA GCG GCA GCT CCA GAG AAG AAA GTT GTT CAA 14 8 

Val Val Gly Ser Val Ala Ala Ala Pro Glu Lys Lys Val Val Gin 
20 25 30 

GTA AGA AAT GTT GAG AAG AAC TAT GGT CTG CTA ACG CCA GGA CTG 193 
Val Arg Asn Val Glu Lys Asn Tyr Gly Leu Leu Thr Pro Gly Leu 
35 40 45 

TTC AGA AAA ATT CCC AAA TTG GAT CCT AAC GAG GGA ATC AGC ACA 23 8 

Phe Arg Lys lie Pro Lys Leu Asp Pro Asn Glu Gly lie Ser Thr 
50 55 60 

GTA ATT GTA TTT GTT AAC CAT AGG GGA AAA GAA ATT GCA GTA AGA 283 
Val lie Val Phe Val Asn His Arg Gly Lys Glu lie Ala Val Arg 
65 70 75 

GTT CTT GAG TTA ATG GGT GCC CAA GTT AGG TAT GTG TAC CAT ATT 328 
Val Leu Glu Leu Met Gly Ala Gin Val Arg Tyr Val Tyr His lie 
80 85 90 

ATA CCC CCA ATA GCT GCC GAT CTT AAG GTT AGA GAC TTA CTA GTC 373 
lie Pro Pro lie Ala Ala Asp Leu Lys Val Arg Asp Leu Leu Val 
95 100 105 

ATC TCA GGT TTA ACA GGG GGT GAA ACT AAG CTT TCA GGT GTT AGG T 419 
lie Ser Gly Leu Thr Gly Gly Glu Thr Lys Leu Ser Gly Val Arg 
110 115 120 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GCTCTAGACT CTGGGAGGAG TAGTTATACT TGATGAAGCC TATTCTGAGT TTTCGGGAAA 
AAG CTTCATA CCAAAAATCA GTGAGTATGA AAATTTAGTA ATTCTAAGGA CGTTTTCAAA 
GGCGTTTGGA CTTGCTGGAA TTAGATGTGG ATATATGATA GCAAATGAAA AGATTATAGA 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
AGAGGGATCC ATGAAGGGGC TGAAAGCT 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
AGAGGCATGC GCTCTAGACT CTGGGAGAGT 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) .TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
ATGAAGGGGC TGAAAGCTCT CATATTAGTG ATTTTAGTTC TAGGTTTGGT AGTAGGGAGC 
GT AG CGGC AG CTCCAGAGAA GAAAGTTGAA CAAGTAAGAA ATGTTGAGAA GAACTATGGT 
CTGCTAACGC CAGGACTGTT CAGAAAAATT CAAAAATTGA ATCCTAACGA GGAAATCAGC 
ACAGTAATTG TATTTGAAAA CCATAGGGAA AAAGAAATTG CAGTAAGAGT TCTTGAGTTA 
ATGGGTGCAA AAGTTAGGTA TGTGTACCAT ATTATACCCG CAATAGCTGC CGATCTTAAG 
GTTAGAGACT TACTAGTCAT CTCAGGTTTA ACAGGGGGTA AAGCTAAGCT TTCAGGTGTT 
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AGGTTTATCC 


AGGAAGACTA 


. CAAAGTTACA 


GTTTCAGCAG 


AATTAGAAGG 


ACTGGATGAG 


420 


TCTGCAGCTC 


AAGTTATGGC 


AACTTACGTT 


TGGAACTTGG 


GATATGATGG 


TTCTGGAATC 


480 


ACAATAGGAA 


TAATTGACAC 


TGGAATTGAC 


GCTTCTCATC 


CAGATCTCCA 


AGGAAAAGTA 


540 


ATTGGGTGGG 


TAGATTTTGT 


CAATGGTAGG 


AGTTATCCAT 


ACGATGACCA 


TGGACATGGA 


600 


ACTCATGTAG 


CTTCAATAGC 


AGCTGGTACT 


GGAGCAGCAA 


GTAATGGCAA 


GTACAAGGGA 


660 


ATGGCTCCAG 


GAGCTAAGCT 


GGCGGGAATT 


AAGGTTCTAG 


GTGCCGATGG 


TTCTGGAAGC 


720 


ATATCTACTA 


TAATTAAGGG 


AGTTGAGTGG 


GCCGTTGATA 


ACAAAGATAA 


GTACGGAATT 


780 


AAGGTCATTA 


ATCTTTCTCT 


TGGTTCAAGC 


CAGAGCTCAG 


ATGGTACTGA 


CGCTCTAAGT 


840 


CAGGCTGTTA 


ATGCAGCGTG 


GGATGCTGGA 


TTAGTTGTTG 


TGGTTGCCGC 


TGGAAACAGT 


900 


GGACCTAACA 


AGTATACAAT 


CGGTTCTCCA 


GCAGCTGCAA 


G C AAAGTT AT 


TACAGTTGGA 


960 


GCCGTTGACA 


AGTATGATGT 


TATAACAAGC 


TTCTCAAGCA 


GAGGGCCAAC 


TGCAGACGGC 


1020 


AGGCTTAAGC 


CTGAGGTTGT 


TGCTCCAGGA 


AACTGGATAA 


TTGCTGC CAG 


AGCAAGTGGA 


1080 


ACTAGCATGG 


GTCAACCAAT 


TAATGACTAT 


TACACAGCAG 


CTCCTGGGAC 


ATCAATGGCA 


1140 


ACTCCTCACG 


TAGCTGGTAT 


TGCAGCCCTC 


TTGCTCCAAG 


CACACCCGAG 


CTGGACTCCA 


1200 


GACAAAGTAA 


AAACAGCCCT 


CATAGAAACT 


GCTGATATCG 


TAAAGCCAGA 


TGAAATAGCC 


1260 


GATATAGCCT 


ACGGTGCAGG 


TAGGGTTAAT 


GCATACAAGG 


CTATAAACTA 


CGATAACTAT 


1320 


GCAAAGCTAG 


TGTTCACTGG 


ATATGTTGCC 


AACAAAGGCA 


GCCAAACTCA 


CCAGTTCGTT 


1380 


ATT AG CGGAG 


CTTCGTTCGT 


AACTGCCACA 


TTATACTGGG 


ACAATGCCAA 


TAGCGACCTT 


1440 


GATCTTTACC 


TCTACGATCC 


CAATGGAAAC 


CAGGTTGACT 


ACTCTTACAC 


CGCCTACTAT 


1500 


GGATTCGAAA 


AGGTTGGTTA 


TTACAACCCA 


ACTGATGGAA 


CATGGACAAT 


TAAGGTTGTA 


1560 


AGCTACAGCG 


GAAGTGCAAA 


CTATCAAGTA 


GATGTGGTAA 


GTGATGGTTC 


CCTTTCACAG 


1620 


CCTGGAAGTT 


CACCATCTCC 


ACAACCAGAA 


CCAACAGTAG 


ACGCAAAGAC 


GTTC CAAGG A 


1680 


TCCGATCACT 


ACTACTATGA 


CAGGAGCGAC 


ACCTTTACAA 


TGACCGTTAA 


CTCTGGGGCT 


1740 


ACAAAGATTA 


CTGGAGACCT 


AGTGTTTGAC 


ACAAGCTACC 


ATGATCTTGA 


CCTTTACCTC 


1800 


TACGATCCTA 


ACCAGAAGCT 


TGTAGATAGA 


TCGGAGAGTC 


CCAACAGCTA 


CGAACACGTA 


1860 


GAATACTTAA 


CCCCCGCCCC 


AGGAACCTGG 


TACTTCCTAG 


TATATGCCTA 


CTACACTTAC 


1920 


GGTTGGGCTT . 


ACTACGAGCT 


GACGGCTAAA 


GTTTATTATG 


GC 




1962 



(2) INFORMATION FOR SEQ ID NO : 3 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
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Met Lys Gly Leu Lys Ala Leu lie Leu Val lie Leu Val Leu Gly 

5 10 15 

Leu Val Val Gly Ser Val Ala Ala Ala Pro Glu Lys Lys Val Glu 
20 25 30 

Gin Val Arg Asn Val Glu Lys Asn Tyr Gly Leu Leu Thr Pro Gly 

35 40 45 

Leu Phe Arg Lys lie Gin Lys Leu Asn Pro Asn Glu Glu lie Ser 

50 55 60 

Thr Val lie Val Phe Glu Asn His Arg Glu Lys Glu lie Ala Val 

65 70 75 

Arg Val Leu Glu Leu Met Gly Ala Lys Val Arg Tyr Val Tyr His 

80 85 90 

lie lie Pro Ala lie Ala Ala Asp Leu Lys Val Arg Asp Leu Leu 
95 100 105 

Val lie Ser Gly Leu Thr Gly Gly Lys Ala Lys Leu Ser Gly Val 
110 115 120 

Arg Phe lie Gin Glu Asp Tyr Lys Val Thr Val Ser Ala Glu Leu 
125 130 135 

Glu Gly Leu Asp Glu Ser Ala Ala Gin Val Met Ala Thr Tyr Val 
140 145 150 

Trp Asn Leu Gly Tyr Asp Gly Ser Gly lie Thr lie Gly lie He 
155 160 165 

Asp Thr Gly lie Asp Ala Ser His Pro Asp Leu Gin Gly Lys Val 
170 175 180 

He Gly Trp Val Asp Phe Val Asn Gly Arg Ser Tyr Pro Tyr Asp 
185 190 195 

Asp His Gly His Gly Thr His Val Ala Ser He Ala Ala Gly Thr 
200 205 210 

Gly Ala Ala Ser Asn Gly Lys Tyr Lys Gly Met Ala Pro Gly Ala 
215 220 225 

Lys Leu Ala Gly He Lys Val Leu Gly Ala Asp Gly Ser Gly Ser 
230 235 240 

He Ser Thr He He Lys Gly Val Glu Trp Ala Val Asp Asn Lys 
245 250 255 

Asp Lys Tyr Gly He Lys Val He Asn Leu Ser Leu Gly Ser Ser 
260 265 270 

Gin Ser Ser Asp Gly Thr Asp Ala Leu Ser Gin Ala Val Asn Ala 
275 280 285 

Ala Trp Asp Ala Gly Leu Val Val Val Val Ala Ala Gly Asn Ser 
290 295 300 

Gly Pro Asn Lys Tyr Thr He Gly Ser Pro Ala Ala Ala Ser Lys 
305 310 315 

Val He Thr Val Gly Ala Val Asp Lys Tyr Asp Val He Thr Ser 
320 325 330 



Phe Ser Ser Arg Gly Pro Thr Ala Asp Gly Arg Leu Lys Pro Glu 
335 340 345 

Val Val Ala Pro Gly Asn Trp lie lie Ala Ala Arg Ala Ser Gly 
350 355 360 

Thr Ser Met Gly Gin Pro lie Asn Asp Tyr Tyr Thr Ala Ala Pro 
365 370 375 

Gly Thr Ser Met Ala Thr Pro His Val Ala Gly lie Ala Ala Leu 
380 385 390 

Leu Leu Gin Ala His Pro Ser Trp Thr Pro Asp Lys Val Lys Thr 
395 400 405 

Ala Leu lie Glu Thr Ala Asp lie Val Lys Pro Asp Glu lie Ala 
410 415 420 

Asp lie Ala Tyr Gly Ala Gly Arg Val Asn Ala Tyr Lys Ala lie 
425 430 435 

Asn Tyr Asp Asn Tyr Ala Lys Leu Val Phe Thr Gly Tyr Val Ala 
440 445 450 

Asn Lys Gly Ser Gin Thr His Gin Phe Val lie Ser Gly Ala Ser 
455 460 465 

Phe Val Thr Ala Thr Leu Tyr Trp Asp Asn Ala Asn Ser Asp Leu 
470 475 480 

Asp Leu Tyr Leu Tyr Asp Pro Asn Gly Asn Gin Val Asp Tyr Ser 
485 490 495 

Tyr Thr Ala Tyr Tyr Gly Phe Glu Lys Val Gly Tyr Tyr Asn Pro 
500 505 510 

Thr Asp Gly Thr Trp Thr lie Lys Val Val Ser Tyr Ser Gly Ser 
515 520 525 

Ala Asn Tyr Gin Val Asp Val Val Ser Asp Gly Ser Leu Ser Gin 
530 535 540 

Pro Gly Ser Ser Pro Ser Pro Gin Pro Glu Pro Thr Val Asp Ala 
545 550 . 555 

Lys Thr Phe Gin Gly Ser Asp His Tyr Tyr Tyr Asp Arg Ser Asp 
560 565 570 

Thr Phe Thr Met Thr Val Asn Ser Gly Ala Thr Lys lie Thr Gly 
575 580 585 

Asp* Leu Val Phe Asp Thr Ser Tyr His Asp Leu Asp Leu Tyr Leu 
590 595 600 

Tyr Asp Pro Asn Gin Lys Leu Val Asp Arg Ser Glu Ser Pro Asn 
605 610 615 

Ser Tyr Glu His Val Glu Tyr Leu Thr Pro Ala Pro Gly Thr Trp 
620 625 630 

Tyr Phe Leu Val Tyr Ala Tyr Tyr Thr Tyr Gly Trp Ala Tyr Tyr 
635 640 645 

Glu Leu Thr Ala Lys Val Tyr Tyr Gly 
650 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) \ LENGTH: 25 base pairs 

(B) \TYPE: nucleic acid 

(C) ^TRANDEDNESS : double 

(d) Topology : linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
TCTGAATTCG TTCTTTTCTG TATGG 



25 



(2) INFORMATION FOR SEQ ID NO: 3 7 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH^ 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEjpNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE\ genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:37: 
TGTACTGCTG GATCCGGCAG 



20 



(2) INFORMATION FOR SEQ lb NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 0 bkse pairs 

(B) TYPE: nuclei* acid 

(C) STRANDEDNESS* double 

(D) TOPOLOGY: linWr 

(ii) MOLECULE TYPE: gendmic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38 
GGATC CATCA GATTTTTGAG TGTAGA*4CAA CCAGTATGCT GCATTTGTAA TTGTGAGATA 60 




ATATCTCCCG CGGGTAAGGT 



80 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base p$irs 

(B) TYPE: nucleic acic 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic dVja 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AGAGGCATGC GTATC CATCA GATTTTTGAG 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 



